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Abstract 

Background: Glandular trichomas are production and storage organs of specialized metabolites such as terpenes, 
which play a role in the plant's defense system. The present study aimed to shed light on the regulation of terpene 
biosynthesis in Solanum lycopersicum trichomes by identification of transcription factors (TFs) that control the 
expression of terpene synthases. 

Results: A trichome transcriptome database was created with a total of 27,195 contigs that contained 743 
annotated TFs. Furthermore a quantitative expression database was obtained of jasmonic acid-treated trichomes. 
Sixteen candidate TFs were selected for further analysis. One TF of the MYC bHLH class and one of the WRKY class 
were able to transiently transactivate 5. lycopersicum terpene synthase promoters in Nicotiana benthamiana leaves. 
Strikingly, SIMYCl was shown to act synergistically with a previously identified zinc finger-like TF, Expression of 
Terpenoids 1 (SIEOTl) in transactivating the SITPS5 promoter. 

Conclusions: High-throughput sequencing of tomato stem trichomes led to the discovery of two transcription 
factors that activated several terpene synthase promoters. Our results identified new elements of the transcriptional 
regulation of tomato terpene biosynthesis in trichomes, a largely unexplored field. 
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Background 

Specialized glandular trichomes can produce and accu- 
mulate large quantities of terpenoids, phenylpropanoids, 
flavonoids and alkaloids, which they can also secrete [1]. 
RNA sequencing in combination with metabolite profile 
analysis of glandular trichomes and proteomics have 
shed light on the biosynthesis of specialized metabolites 
in the trichomes of various plant species [2]. Through 
the production of EST libraries, micro-arrays and high- 
throughput sequencing of (glandular) trichome RNA, genes 
have been identified that are involved in the terpenoid, 
phenylpropanoid, alkaloid and flavonoid biosynthesis in 
various plant species, including tomato [3-5], sweet basil 
[6,7], tobacco [8,9], mint [10], alfalfa [11], Artemisia annua 
[12] and hop [13]. Although EST sequencing has been 
instrumental in the discovery of enzymes of trichome- 
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specialized metabolism [4], next generation sequencing 
(NGS) can give a more in-depth picture of transcriptomes. 
NGS technologies (i.e. RNA sequencing) has been used for 
characterization of several trichome transcriptomes, for ex- 
ample from plants of medical importance like Artemisia 
annua (Asteraceae; [12]) or Huperzia serrata and Phleg- 
mariurus carinatus (Huperziaceae; [14]). NGS has also 
been used for gene discovery, for example in combination 
with shotgun proteomics and metabolite analysis of tomato 
{Solanum lycopersicum) trichomes, leading to the discovery 
of the leaf-trichome-specific |3-caryophyllene/a-humulene 
synthase (CAHS; [4]). NGS of trichomes RNA from wild 
and cultivated tomato varieties led to the discovery and 
characterization of various sesquiterpene synthases, provi- 
ding insight into the evolution of terpene synthases [15]. 

Terpene biosynthesis in tomato plants is of major 
interest because terpenes play an important role in the 
plant's defense [16-20]. The sequencing of the cultivated 
tomato genome has enabled the characterization of its 
terpene synthase (TPS) gene family [21,22], but not much 
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is known about the regulation of the terpenoid pathway. 
Transcriptional control of biosynthetic genes is a major 
mechanism by which secondary metabolite production is 
regulated [23,24]. 

There are not many transcription factors (TFs) known 
to be involved in regulation of terpenoid pathways. 
ORCA3, a jasmonate-responsive APETALA2 (AP2)-do- 
main transcription factor from Catharanthus roseus, has 
been shown to regulate expression of Strictosidine Synthase 
(STR) involved in terpene indole alkaloid biosynthesis [25]. 
Subsequently, a methyl-jasmonate (MeJA)-inducible tran- 
scription factor of the MYC family {CrMYC2) was shown 
to positively regulate 0RCA3 [26]. CrWRKYl was identi- 
fied as being involved in the root-specific accumulation of 
serpentine in C. roseus plants and as being induced by phy- 
tohormones including JA [27]. This TF appeared to nega- 
tively regulate 0RCA3 and to a lesser extend CrMYC2 
[27]. A MeJA-inducible WRKY transcription factor from 
Gossypium arboreum that regulates the sesquiterpene syn- 
thase {+)-S-cadiene synthase A in cotton fibers was iden- 
tified by Xu et al. [28]. Ma et al. [29] demonstrated that a 
MeJA-inducible WRKY transcription factor from Artemisia 
annua is involved in the regulation of artemisinin bio- 
synthesis. More recendy two JA-responsive AP2 family 
transcription factors from A. annua (AaERFl and 2) were 
found to regulate Amorpha-4,ll-diene synthase {ADS}, a 
sesquiterpene synthase involved in the biosynthesis of arte- 
misinin [30] whereas Lu et al. [31] identified AaORA, a 
AP2/ERF TF, that regulates several genes in the artemisinin 
biosynthetic pathway including ^fl£7?fi. Most recently, the 
MeJA-inducible Arabidopsis thaliana MYC2 transcription 
factor [32] was shown to regulate sesquiterpene synthases 
AtTPS21 mdAtTPSll [33]. 

Here, we used NGS of tomato stem trichomes as a 
tool for gene discovery. First, a transcript database was 
created from normalized cDNA, which was mined for 
transcription factors. Then, in order to narrow down the 
number of TFs potentially involved in terpene biosynthesis, 
an expression profiling database was created using lUumina 
sequencing of trichome RNAs from plants treated with or 
without jasmonic acid (JA), since J A is known to induce 
terpene emission in tomato and to regulate several terpene 
synthases [16,21,34,35]. To identify TFs that regulate 
terpene biosynthesis we used a transient assay based on 
the transactivation of tomato terpene synthase promoters 
in planta. 

Results 

Assembly of RNAseq data and Genome Analyzer II 
transcript profiling 

We created a tomato trichome EST database by sequen- 
cing a mixture of glandular and non-glandular trichome 
RNAs, derived from stems of Solanum lycopersicum cv. 
Moneymaker plants. The resulting cDNA was normalized 



prior to being used as input for 454 GS FLX Titanium 
pyrosequencing. A full plate was sequenced consisting of 
two halves: one with cDNAs originating from control 
plants and the other half with cDNAs originating from 
plants treated with JA. In total we obtained 979,076 high- 
quality reads with an average length of 337 bp. The reads 
from control and JA-treated samples were assembled de 
novo resulting in 27,195 contigs with an average length of 
931 bp, leaving 24,187 reads unmatched (singletons), with 
an average length of 241 bp. Nucleotide sequences of the 
contigs were blasted against the Solanaceae Genomics 
Network (SGN) tomato database for annotation, using a 
local E-Blast tool; 3,295 contigs were not annotated. 

For creating the transcript profiling databases with 
Genome Analyzer II, the same RNA material as for the 
454 sequencing was used, but this time the cDNA derived 
from control and JA-treated stem trichomes was not 
normalized before being processed. We specifically ob- 
tained 5,631,975 3' sequences from the Control sample 
and 5,882,547 from the JA-treated sample. 4,840,738 and 
5,169,891 reads from the Control and JA-samples, respect- 
ively, were mapped to one unique contig of the trichome 
database. In addition, 38,699 (C) and 45,375 (JA) reads 
were mapped to multiple contigs and 791,237 (C) and 
712,656 (JA) remained unmapped. 

Both the 454 GS FLX Titanium reads and the Genome 
Analyzer II reads can be found in the Sequence Read 
Archive of NCBI (http://www.ncbi.nlm.nih.gov/sra) under 
accession number SRP041373. 

Annotation, gene ontology and protein families 

In order to characterize the S. lycopersicum stem trichome 
transcriptome the unique contigs (27,195 ESTs) were sub- 
mitted to homology searches (BLASTX) in the National 
Center for Biotechnology Information (NCBI) non-redun- 
dant protein database using Blast2GO [36]. 4,733 contigs 
did not return a BLASTX hit. The majority of the top hits 
were to protein sequences of Vitis vinifera, followed 
by Populus trichocarpa, Ricinus communis and Solanum 
lycopersicum. 

Next, gene ontology (GO) and enzyme classifications 
(EC) were performed in order to classify the ESTs. It 
must be noted that one sequence could be assigned to 
more than one GO term. For the cellular component 
class the assignments were mostly given to cell and organ- 
elle (54,82% and 29,35% respectively; Additional file 1: 
Figure Sla). The highest percentage of molecular function 
GO terms were in binding and catalytic activity (42,96% 
and 41,38% respectively; Additional file 1: Figure Sic). In 
the biological processes, the majority of the GO terms was 
grouped into two categories- those of metabolic and cellu- 
lar process (36,55% and 32,79% respectively; Additional 
file 1: Figure Sib). Finally, within the predicted ECs, 
the prevailing categories of enzymes were transferases 
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and oxidoreductases (31,38% and 29,65% respectively; 
Additional file 1: Figure Sid). 

The search of additional databases for protein families, 
domains, regions and sites was performed from Blast2GO 
via the InterPro EBI web server. The 30 top InterPro en- 
tries obtained are presented in Table 1. The most domin- 
ant class of enzymes was protein kinases. Abundantly 
represented were also cytochrome P450s. 

Finally, within Blast2GO, the EC numbers were clas- 
sified in KEGG pathways, enabling the presentation of 
enzymatic functions in the context of the metabolic 
pathways in which they are part of (Blast2GO Tutorial, 
[37]). Among the pathways identified, the ones related 
to secondary metabolism are shown in Table 2. Lipid 
transfer proteins represented 0.19% of the tomato stem 
trichome transcripts. 



Transcripts of enzymes involved in jasmonic acid bio- 
synthesis and signaling pathways were also identified in 
the trichome database. Data for a selection of enzymes 
are presented in Table 3, including known JA marker 
genes such as LOXA (U09026), AOC (AW624058; [38]), 
JAZl (EF591123; [39]) and JAZ3 (EU194561; [40]). 

A closer look was taken at the terpene biosynthesis 
pathway (Figure 1) in order to see if the precursor path- 
ways were up-regulated by JA. As shown in Table 4, ex- 
pression of some precursor genes in tomato was induced 
by JA although not strongly (max induction ~2.5-fold 
for HDS). As in other plants [41], genes encoding enzymes 
of the precursor pathways can belong to small gene families 
and it appears that expression levels and JA-inducibiUty of 
these members can vary. Transcript abundance of precur- 
sor genes is presented in Table 4 for comparison with the 



Table 1 Summary of the most common InterPro entries found In the S. lycopersicum stem trichome transcrlptome 



InterPro 


Frequency 


Description 


rnU 1 lUUy 


j/ 


Protein l<in3se-lil<e domain 


rnUUU/ 1 y 


521 


Protein l<inase, catalytic domain 


IrKUUzzyU 


336 


Serine/threonine-/dual-specificity protein l<inase, cata ytic domain 


DDnnQ T7 1 
rKUUoz/ 1 


286 


Serlne/threonine-protein l<inase, active site 


rKUzUojj 


Z03 


Tyrosine-protein kinase, catalytic domain 


DDm ^^n^ n 
rnU I0U4U 


263 


NAD(P)-binding domain 


r KU 1 jUoj 


z jy 


7inr finrior D 1 MT /PV\/P /DW Pi t\/r^o 

Zinc iinyei, niiNo/r i vt/rriu-Lype 


DDm 7/1/11 
r nU 1 / 1 


1 P7 

1 o/ 


Protein kinase, ATP binding site 


PR002885 


181 


Pentatricopeptide repeat 


IPR015943 


175 


WD40/Y\/rN repeat-like-containing domain 


IPR001841 


172 


Zinc finger, RING-type 


IPR012677 


172 


Nucleotide-binding, alpha-beta plait 


IPR016024 


166 


Armadillo-type fold 


IPR001245 


164 


Serine-threonine/tyrosine-protein kinase catalytic domain 


IPR000504 


158 


RNA recognition motif domain 


IPR001680 


153 


WD40 repeat 


IPR011046 


152 


WD40 repeat-like-containing domain 


IPR011990 


147 


Tetratricopeptide-like helical 


IPR011989 


145 


Armadillo-like helical 


IPR001128 


141 


Cytochrome P450 


IPR017986 


133 


WD40-repeat-containing domain 


IPR017853 


130 


Glycoside hydrolase, superfamily 


IPR012287 


119 


Homeodomain-related 


PR001611 


115 


Leucine-rich repeat 


PR009057 


115 


Homeodomain-like 


PR016196 


112 


Major facilitator superfamily domain, general substrate transporter 


PR012336 


108 


Thioredoxin-like fold 


PR013781 


107 


Glycoside hydrolase, subgroup, catalytic domain 


PR002213 


102 


UDP-glucuronosyl/UDP-glucosyltransferase 


IPR002401 


95 


Cytochrome P450, E-class, group 1 
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Table 2 KEGG pathways related to biosynthesis of secondary metabolites found in the S. lycopersicum stem trichome 
transcriptome 

KEGG pathway 

Terpenoid biosyntliesis 



Plienylpropanoid biosyntliesis 



Flavonoid biosyntliesis 



AII<aloid biosynthesis 



Steroid biosynthesis 



EC nr 


Enzyme name 


Nr of seq 


eel. 1.1. 208 


(+)-neomenthol dehydrogenase 


1 


ec:41.1.33 


diphosphomevalonate decarboxylase 


3 


ec:2.2.1.7 


1 -deoxy-D-xylulose-5-phosphate synthase 


2 


ec:1.17.1.2 


4-hydroxy-3-methylbut-2-enyl diphosphate reductase 


1 


ec:2.7.7.60 


2-C-methyl-D-erythritol 4-phosphate cytidylyltransferase 


1 


ec:2.5.1.1 


dimethylallyltranstransferase 


1 


eel. 17.7.1 


{E)-4-hydroxy-3-methylbut-2-enyl-diphosphate synthase 


1 


ec:2.7.1.148 


4-(cytidine 5'-diphospho)-2-C-methyl-D-erythritol kinase 


1 


eel. 1.1. 267 


1-deoxy-D-xylulose-5-phosphate reductoisomerase 


1 


ec:1.1.1.34 


hydroxymethylglutaryl-CoA reductase (NADPH) 


6 


ec:2.5.1.31 


ditrans.polycis-undecaprenyl-diphosphate synthase 


4 


ec:2.3.3.10 


hydroxymethylglutaryl-CoA synthase 


6 


ec:5.3.3.2 


isopentenyl-diphosphate Delta-isomerase 


2 


ec:2.5.1.32 


phytoene synthase 


2 


ec:2.1. 1.104 


caffeoyl-CoA 0-methyltransferase 


4 


ec:l. 11.1.7 


peroxidases 


56 


ec:3.2.1.21 


beta-glucosidase 


7 


ec:2.1.1.68 


caffeate 0-methyltransferase 


1 


eel. 1413.11 


trans-cinnamate 4-monooxygenase 


1 


ee6.2.1.12 


4-coumarate — CoA ligase 


4 


ee2.1. 1.104 


caffeoyl-CoA 0-methyltransferase 


4 


ee2.3.1.74 


naringenin-chalcone synthase 


2 


eel. 141 1.23 


flavonol synthase 


7 


eel. 141 3.88 


flavonoid 3',5'-hydroxylase 


3 


eel. 141 1.9 


flavanone 3-dioxygenase 


1 


eel. 1.1.219 


dihydrokaempferol 4-reductase 


2 


eel. 1413.21 


flavonoid 3'-monooxygenase 


3 


eel. 1413.11 


trans-cinnamate 4-monooxygenase 


1 


ee5.5.1.6 


chalcone isomerase 


1 


ec:43.3.2 


strictosidine synthase 


2 


ee41.1.28 


aromatic-L-amino-acid decarboxylase 


4 


eel. 141 1.20 


deacetoxyvindoline 4-hydroxylase 


1 


ec:2.6.1.42 


branched-chain-amino-acid transaminase 


3 


eel. 14.21. 6 


lathosterol oxidase 


1 


ee2.1.141 


sterol 24-C-methyltransferase 


1 


ee2.5.1.21 


squalene synthase 


2 


ec:5.3.3.5 


cholestenol Delta-isomerase 


1 


ee2.1.1.6 


catechol 0-methyltransferase 


1 


eel. 1.1. 145 


3beta-hydroxy-Delta5-steroid dehydrogenase 


2 


eel. 14141 


unspecific monooxygenase 


1 


eel. 3.99.5 


3-oxo-5alpha-steroid 4-dehydrogenase 


4 
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Table 3 List of selected enzymes involved in the jasmonic acid biosynthesis and signaling 



Abbreviation 


Contig Nr 


SGN Nr 


Annotation 


Transcript length 
(bases) 


Expression 
values J a" 


Expression 
values C" 


Fold JA 
induction 


LOXA^ 


6402 


SGN-U592535 


Lipoxygenase A 


2837 


35.2 


1.6 


22 


AOC" 


24817 


SGN-U562649 


Allene oxide cyclase 


1645 


1 898.8 


417.6 


4.5 


JAZi'' 


6863 


SGN-U579837 


Jasmonate ZiM-domain 1 


1156 


12.2 


0.7 


174 


JAZ3^ 


20751 


SGN-U564446 


Jasmonate ZiM-domain 3 


986 


18.87 


2.3 


8.2 


con" 


24353 


SGN-U568988 


Coronatine-insensitive 1 


2260 


11 


8.5 


1.29 



=[38]: LOXA (U09026), AOC (AW624058), COIl (NM_001 247535), "[39]: JAZ1 (EF591 123), '[40): JAZ3 (EU 194561). 
*Expression vaiues are in RPKM (reads per kilobase of transcript per miliion mapped reads). 



expression levels of the 13 terpene synthases (TPSs) found 
in stem trichomes Table 5). 

Selection of transcription factors potentially involved In 
regulating terpene synthases 

Based on the annotated contigs 743 transcription factors 
of different classes were found in the trichome database: 
69 WRKY, 151 MYB, 8 MYC, 52 bZIP, 9 ARF, 71 ERF, 
17 ZnF, 28 bHLH, 12 MADS, 1 NAC and 325 of unlcnown 
function/class. Out of those, 151 were up-regulated (>1.5x) 
by the treatment with JA, 119 were down-regulated (<0.67x) 
and expression of 473 TFs remained unaltered. Since JA is 
known to play a role in the plant's direct and indirect 
defenses we were interested in those transcription factors 
that were induced by JA and could therefore potentially 
be involved in up-regulating terpene biosynthesis. 56 of 
the TFs that were up-regulated by JA showed an induction 



higher than 2-fold. The sequence of these 56 TFs was 
blasted against the tomato genomic sequence (Solanaceae 
Genomics Network, SGN) and complete ORFs were con- 
structed when possible (GENSCAN, [46]), if not provided 
by the RNAseq. These sequences were submitted to hom- 
ology search after translation against the NCBI database 
for identifying conserved domains. From this analysis 16 
TFs (Table 6) were selected for further investigation as 
follows: we focused on classes of TFs involved in the regu- 
lation of terpenoids identified so far in other plant species- 
namely TFs of the APETALA2 class [25,30,31], WRKY class 
[27-29] and MYC class [26,33]. In total eleven transcription 
factors of the AP2 class, four of the WRKY class and one 
of the MYC class, although it only showed a 1.4-fold in- 
duction, were selected for further investigation of their 
potential involvement in regulating expression of terpene 
synthases. 




Figure 1 Enzymes involved in the precursor biosynthesis for mono-(CIO), sesqui-(CIS) and di-(C20) terpenes. An explanation of the 
abbreviations used in the pathways and the GAII reads for each enzyme are shown in Tables 4 and 5, respectively. ER; endoplasmic reticulum, TPS; 
terpene synthase, SPS; solanesyl diphosphate synthase. The grey oval circles between organelles and at the cell membrane represent putative 
transporter systems. Peroxisomal localization of precursor enzymes of the MVA pathway has been previously reported in Arabidopsis thaliana and 
Catharanthus roseus [42-44]. 
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Table 4 Enzymes involved in the biosynthesis of precursors of mono-, sesqui- and diterpenes 



Abbreviation 


Name 


Chr 


SGN nr 


Expression 
values JA 


Expression 
values C 


Fold JA 
induction 


AACT 


acetoacetyl-coenzyme A thiolase 


5 


SGN-U566720 


351.04 


275.21 


1.27 






7 


SGN-U566719 


126.3 


133.52 


0.94 


HMGS 


3-hydroxy-3-methylglutaryl-CoA synthase 


8 


SGN-U579858 


14.95 


15.98 


0.93 






8 


SGN-U578388 


205.34 


123.57 


1.56 


HMGR 


3-hydroxy-3-methylglutaryl-CoA reductase 


2 


SGN-U580675 


2.2 


1.06 


2.07 






2 


SGN-U578017 


19.2 


22.19 


0.86 






3 


SGN-U579319 


43.35 


24.5 


1.77 


MVK 


mevalonate kinase 


1 


SGN-U567385 


1 98.89 


125 


1.59 


pMVK 


phosphomevalonate kinase 


8 


SGN-U583971 


46.92 


54.51 


0.86 


MDC 


mevalonate dipliosphate decarboxylase 


4 


SGN-U587221 


34 


3.19 


1.06 






11 


SGN-U581971 


26.58 


35.51 


0.75 


DXS1 


1-deoxy-D-xylulose 5-phosphate synthase 1 




SGN-U567647 


2414 


3533 


0.68 


DXS2 


1-deoxy-D-xylulose S-phosphate synthase 2 


11 


SGN-U582996 


37.23 


22.72 


1.54 


DXR 


1-deoxy-D-xylulose S-phosphate reductoisomerase 




SGN-U585813 


657.7 


497.34 


1.32 


MCT 


4-diphosphocytidyl-2-C-methyl-D-erythritol synthase 




SGN-U566797 


107.95 


121.45 


0.89 


CMK 


4-diphosphocytidyl-2-C-methyl-D-erythritol kinase 




SGN-U583224 


275.9 


237.57 


1.16 


MDS 


2C-methyl-D-erythritol 2,4-cyclodiphosphate synthase 




SGN-U568497 


36.72 


41.73 


0.88 


HDS 


l-hydroxy-2-methyl-2-(E)-butenyl 4-diphosphate synthase 


11 


SGN-U567167 


3043 


12.25 


248 


HDR 


l-hydroxy-2-methyl-2-(E)-butenyl 4-diphosphate reductase 


^ 


SGN-U580558 


2916.93 


2720.54 


1.07 


IDI 


isopentenyl diphosphate isomerase 


5 


SGN-U569721 


34 


1.95 


1.74 






4 


SGN-U577516 


9574 


749.1 1 


1.28 


GPS 


geranyl diphosphate synthase 


8 


SGN-U573523 


2.71 


3.35 


0.8 


NDPS (CPT1) 


neryl diphosphate synthase 


8 


SGN-U583641 


2810.51 


4595.4 


0.6 


FPS 


farnesyl diphosphate synthase 


12 


SGN-U580757 


11.05 


9.75 


1.13 






10 


SGN-U578686 


0.58 


1.24 


0.55 






10 


SGN-U581576 


15.98 


25.21 


0.53 


GGPS 


geranyl geranyl diphosphate synthase 


4 


SGN-U571085 


6.12 


7.45 


0.82 






9 


SGN-U575882 


63.58 


55.05 


0.96 






2 


SGN-U573348 


17.17 


34.8 


049 


CPT3 


c/5-prenyl transferase 3 


3 


SGN-U572901 


1445 


17.58 


0.82 


CPT4 


c/5-prenyl transferase 4 


10 


SGN-U568982 


9.52 


7.28 


1.3 


CPT5 


c/5-prenyl transferase 5 


10 


SGN-U585528 


120.17 


70.3 


1.7 


CPT7 


c/5-prenyl transferase 7 


6 


SGN-U574892 


291.53 


471.59 


0.52 



Chr, chromosome. 

*Expression values are in RPKM (reads per kilobase of transcript per million mapped reads). 
For an overview of the biosynthetic pathway see Figure 1. 



Tissue specificity and JA responsiveness of selected 
transcription factors 

The sixteen candidate TFs should ideally be trichome- 
specifically expressed and possibly induced by jasmonic 
acid. In order to investigate the expression pattern of 
these genes, cDNA was synthesized from different S. 
lycopersicum cv. Moneymaker organs and tissues: leaves, 
stems, isolated stem trichomes and roots from 4-week-old 
plants, as well as flowers and fruit of mature plants. In 



Figure 2 transcript levels, as determined by Q-RT-PCR, 
are presented for four of the sixteen selected transcription 
factors. For the other twelve candidate TFs expression in 
the trichomes was much lower than that in the other 
organs/tissues and these were excluded from further ana- 
lysis. TF SlMYCl (KF430611) was predominately expressed 
in trichomes, but also in leaves and flowers (Figure 2a). 
SIWRKY78 was expressed in leaves, trichomes, roots and 
flowers (Figure 2b). SIWRKY28 was a trichome-specific 
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Table 5 Terpene synthases (TPS) found in S. lycopersicum 
stem trichomes 



1 r J 


Trsnscript l6nQth 
(bases) 


Expression 
values JA" 


Expression 
values C" 


rOia JM 

induction 


3 


2099 


15.8 


2.3 


6.87 


5 


2186 


103.18 


46.7 


2.2 


7 


1069 


0.5 


0.18 


2.78 


9 


2011 


3060.75 


2359.03 


1.3 


12 


407 


4.08 


2.66 


1.53 


16 


1868 


18.36 


16.87 


1.09 


17 


1190 


4.25 


3.55 


1.2 


19 


776 


44.7 


37.46 


1.19 


20 


1148 


142.79 


106.89 


1.33 


24 


854 


0.17 


0.18 


0.94 


31 


1991 


0.68 


0.18 


3.78 


39 


1131 


12.07 


5.68 


2.12 


41 


2368 


71.57 


66.58 


1.07 



*Expression values are in RPKM (reads per kilobase of transcript per million 
mapped reads). 

For SITPS8 no transcripts were identified in Xhe stem trichome database 
although) by Q-RT-PCR minimal expression has been observed [45]. 



gene (Figure 2c) and SIWRKY73 was expressed in tricho- 
mes, roots and fruit (Figure 2d). Q-RT-PCR analyses 
indicated that none of the selected transcription factors 
was significantly induced by JA according (Figure 2). 
SIWRKY73 expression appeared to be approximately 1.7- 
fold reduced in JA treated plants (p = 0.07). 

SIMYCI and SIWRKY73 can transactivate terpene synthase 
promoters in Nicotiana benthamiana leaves 

In order to investigate whether these TFs could activate 
a selection of terpene synthase promoters, a transient 
assay in Nicotiana benthamiana leaves was used, which 
has been previously shown to work for the interaction 
between the zinc finger-like transcription factor Expres- 
sion of Terpenoids 1 (SlEOTl) and the SITPSS promoter 
[47]. In the reporter construct, expression of p-glucuroni- 
dase {uidA, GUS) is driven by the glandular trichome- 
specific promoter of SITPSS. Co-infiltration with the 35S: 
SlEOTl effector construct resulted in transactivation of the 
SITPSS promoter, leading to GUS expression in this heter- 
ologous system (Figure 3). As negative control for the 
effector, a 35S:RFP construct was used. Various other 
reporter constructs with promoters of other terpene 
synthases- SITPSS, SITPS7 and SITPSS- driving expression 
of GUS or a GUSsYFPl flision {SITPS9) were included in 
the analyses. 



Table 6 List of selected S. lycopersicum transcription factors (TF) potentially involved in terpene biosynthesis 



Name 


Contig nr 


SGN nr 


Annotation 


Transcript 
length (bases) 


Expression 
values JA" 


Expression 
values C* 


Fold JA 
induction 


SIAP2_9 


83 


SGN-U572361 


ERF (ethylene response factor) subfamily 
B-3 of ERF/AP2 TF family 


371 


9.69 


4.08 


2.37 


SIAP2_2 


1719 


SGN-U596590 


DREB subfamily A-1 of ERF/AP2 TF family 


422 


13.6 


1.95 


6.97 


SIAP2_6 


5289 


SGN-U563871 


AP2 domain-containing TF 


933 


45.39 


12.78 


3.55 


SIAP2_3 


7031 


SGN-U563215 


DREB subfamily A-1 of ERF/AP2 TF family 


981 


5.95 


1.42 


419 


SIAP2_7 


7865 


SGN-U587768 


DREB subfamily A-4 of ERF/AP2 TF family 


775 


2.2 


0.71 


3.09 


SIAP2_4 


10714 


SGN-U586437 


ERF (ethylene response factor) subfamily 
B-3 of ERF/AP2 TF family 


611 


2.72 


0.71 


3.83 


SIAP2_10 


14672 


SGN-U585539 


AP2 domain-containing TF 


297 


18.87 


7.99 


2.36 


SIAP2_5 


16204 


SGN-U577088 


ERF (ethylene response factor) subfamily 
B-4 of ERF/AP2 TF family 


523 


0.58 


0.18 


3.78 


SIAP2_1 1 


25582 


SGN-U584756 


ERF (ethylene response factor) subfamily 
B-2 of ERF/AP2 TF 


660 


1.19 


0.53 


2.24 


SIAP2_1 


25985 


SGN-U586438 


ERF (ethylene response factor) subfamily 
B-3 of ERF/AP2 TF family 


942 


16.32 


0.53 


30.8 


SIAP2_8 


26482 


SGN-U581852 


Ethylene-responsive element-binding 
factor 4 homolog 


788 


58.81 


24.32 


242 


SIWRKY22 


9827 


SGN-U565154 


WRKY family TF 


1366 


0.68 


0.18 


3.78 


SIWRKY28 


10561 


SGN-U584367 


WRKY family TF 


1326 


0.5 


0.18 


2.78 


SIWRKY78 


13200 


SGN-U565157 


WRKY family TF 


1099 


1.19 


0.18 


6.61 


SIWRKY73 


20918 


SGN-U571278 


WRKY family TF 


1453 


0.5 


0.18 


2.78 


SIMYCI 


24332 


SGN-U576396 


IVIYCTF 


2174 


3247 


22.7 


1.43 



•Expression values are in RPKM (reads per kilobase of transcript per million mapped reads). 
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( ^ 

(See figure on previous page.) 

Figure 2 Tissue specific expression and JA induction of selected TFs. Transcript levels for (a) SIMYCl (b) SiWRKY78, (c) SIWRKY28 and (d) 
SIWRKY73 as determined by Q-RT-PCR. Mean values (+SE) of 3 biological replicas are shown, normalized for Actin expression. L; leaf W5; whole 
stem, S5; bald stem, T; stem trichomes, fi; root, Fr; fruit, Fi; flower; C; control and J/1; jasmonic acid induced stem trichomes. ns; not significant 
according to 7"-test. 



As shown in Figure 3a, S1WRKY73 could transactivate 
the SITPSS promoter, albeit to a lower extent than SlEOTl. 
S1WRICY73 transactivated the SITPSS and SITPS7 pro- 
moters only weakly, and the SITPSS and SITPS9 promoters 
not at all (35S:RFP negative controls shown in Additional 
file 1: Figure S2). S1WRKY78 or S1WRKY28 did not trans- 
activate any of the terpene synthase promoters (Additional 
file 1: Figure S3). 

SIMYCl could transactivate all terpene synthase pro- 
moters tested except SITPSS. Transactivation of the 
trichome-specific SITPSS and SITPSS promoters was 
strongest (Figure 3b; 35S:RFP negative control shown 
in Additional file 1: Figure S2). However, it should be 
noted that GUS activity of a promoter driving the GUS- 
sYFPl fusion was lower than when the same promoter 
driving GUS alone was transactivated by an effector 



construct (data not shown), possibly because the fusion 
protein was less stable or produced. Therefore, transacti- 
vation by SIMYCl of the trichome-specific SITPS9 pro- 
moter was potentially stronger than that detected here. 

SIMYCl and SlEOTl act synerglstlcally on the SITPSS 
promoter In N. benthamiana leaves 

Since SlEOTl, SIMYCl and S1WRKY73 were shown in 
separate experiments to be able to transactivate the 
SITPSS promoter (Figure 3), we investigated what effect 
a combination of these transcription factors would have 
on the transactivation of this promoter. To this end, 
Agrobacterium cultures carrying the CaMV 35S -driven 
effector constructs were mixed in pairs or all three 
together and combined with the SlTPSSp: GUS re- 
porter construct and infiltrated in N. benthamiana leaves 
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+ 


SITPS5p:GUS 


35S:MYC1 


+ 


SirPS9p:GUSsYFPl 
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Figure 3 Transactivation of terpene synthase promoters by SIIVIYC1 and SIWRKY73 in N. benthamiana leaves. Normalized GUS activity after 
co-infiltration with A. tumefaciens harboring the (a) 35S:WRKY73 or (b) 35S:MYC1 effector construct and various promoterGUS reporter constructs. 
The 35S:SIE0T1 and 35S;RFP effector constructs were used as positive and negative control, respectively. The bars represent the obtained mean 
values and the error bars the standard error (n = 3). RFP; red fluorescent protein. Letters indicate significant differences (ANOVA, P < 0.05 according 
to Tuckey's B posthoc test). Representative results from three experiments are shown. 
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(Figure 4). Interestingly, co-expression of SlEOTl and 
SlMYCl almost tripled the transactivation of SITPSS pro- 
moter compared to the effect of each TF alone. Adding 
S1WRKY73 did not have an additional effect, but rather 
seemed to have a negative effect on the combinatorial 
action of the other two TFs, although not at a statistically 
significant level (Figure 4). 

Discussion and conclusions 

RNA sequencing of S. lycopersicum stem trichomes led 
to the identification of one MYC bHLH and one WRKY 
transcription factor that can transactivate several terpene 
synthase promoters. The observation that SlMYCl acts 
synergistically with SlEOTl in the transactivation of the 
SITPSS promoter suggests a complex regulatory network 
for terpene biosynthesis. 

High-throughput sequencing of Solanum lycopersicum 
stem trichomes 

We used massive parallel pyrosequencing on the 454 GS 
FLX Titanium platform to sequence S. lycopersicum stem 
trichome RNAs with the goal to identify transcription fac- 
tors involved in terpene biosynthesis. We used normalized 
cDNA to maximize representation of low abundant tran- 
scripts and reduce representation of highly abundant 
transcripts. Attempts to map the obtained reads to the 
publicly available mixed tissue SGN database led to a high 
percentage of unmapped reads and assignment of the 
same reads to multiple unigenes and therefore the reads 
were assembled de novo. 2.5% of the reads could not be 
matched and were not used in further analysis. 87.9% of 
the resulting contigs were subsequently annotated after 
blasting against the SGN tomato database using a local 
E-Blast tool. In this database we identified annotated en- 
zymes involved in several metabolic pathways (Additional 
file 1: Table SI). In short, compared to the study published 
by McDowell and colleagues [3] on S. lycopersicum cv. 
M82 trichomes, we identified in Moneymaker trichomes 



cDNAs encoding enzymes involved in for example 
the TCA cycle, starch and sucrose metabolism (Additional 
file 1: Table SI), as well as secondary metabolite biosyn- 
thesis (Table 2). Photosynthesis related genes were also 
identified but were not as prevalent (Additional file 1: 
Table SI) as in M82 trichomes. Such differences could 
originate from the fact that in our study we used a mix of 
Moneymaker trichome types, including stalks, whereas 
McDowell and colleagues focused on comparing different 
types of trichomes between Solanum species and so 
clipped off and analyzed only the secretory cells of glandu- 
lar trichomes [3]. 

Furthermore we created an expression profiling data- 
base using Illumina sequencing in order to obtain genes 
regulated by JA. The success of the JA treatment is evi- 
dent by the high induction of known JA markers, some 
of which are presented in Table 3 (LOXA, AOC [38]; 
JAZl [39]; JAZ3 [40]). 

Jasmonic acid regulation of the terpene biosynthesis 
pathway in tomato trichomes 

In order to investigate whether in stem trichomes of 
tomato Moneymaker plants, regulation of terpene bio- 
synthesis by JA is also on the precursor level besides on 
the level of individual TPSs [21], the quantitative data- 
base was mined for enzymes of the precursor pathways. 
The copy number of these genes varies between different 
plant species [41] and, as shown in Table 4, different 
family members can vary in their expression levels and/or 
JA-inducibility. For example 1-deoxy-d-xylulose 5-phos- 
phate synthase (DXS), in contrast with Arabidopsis, which 
contains a single functional gene, has diversified into two 
isogenes in other plant species including tomato [48]. 
Whereas SlDXSl is ubiquitously expressed, SIDXS2 is 
expressed only in a few tissues and in leaf trichomes its 
transcript abundance is much higher than that of SlDXSl 
[49], although this is not the case in stem trichomes 
(Table 4). Furthermore, SIDXS2 is moderately induced by 



I 3Ss|^>clT(xlor| I promotCT >reponer I 



35S:E0T1 + 35S:MYCI + 35S:WRKY73 + SITPS5p:GLIS 

35S:MYCI + 35S:WRKY73 + SITPS5p:GUS 

35S:EOTI + 35S:WRKY73 + SITPS5p:GUS 

35S:EOTI +35S:MYC1 + SITPS5p:0US 
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35S:EOTI + SITPS5p:GUS 

35S:RFP + SITPS5p:GUS 



0 2 4 6 8 10 12 14 16 18 
normalizcil GUS aclivily (GUSiLUC) 

Figure 4 Transactivation of SITPSS promoter by SlEOTl, SlMYCl, SIWRKY73 or combination thereof in N. benthamiana leaves. Normalized 
GUS activity after co-infiltration with A tumefaciens harboring the 35S:EOT1, 35S:MYC1, 35S:WRKY73 effector constructs or combination thereof 
and the SITPS5p:GUS reporter construct The 35S:RFP effector construct was used as negative control. The bars represent the obtained mean 
values and the error bars the standard error (n = 4). RFP; red fluorescent protein. Letters indicate significant differences (ANOVA, P< 0.05 according 
to Tuckey's B posthoc test). Representative results from two experiments are shown. 
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wounding in the cultivar Moneymaker [49], which corre- 
lates with the observed moderate induction of SIDXS2 by 
JA (~1.6-fold, Table 4). SIDXS2 expression is also appro- 
ximately threefold upregulated in the tomato cultivar 
Castlemart upon feeding by Manduca sexta larvae [50] . 

The regulation of precursor genes of the MEP pathway 
by wounding, hormones or elicitors has been demonstra- 
ted in various plant species [49-54] . Similarly, evidence for 
the regulation of precursor biosynthesis of the mevalonate 
(MVA) pathway is also abundant [55-60]. For example, 
HMGR enzyme activity and protein level were shown to 
increase by fungal infection in potato tubers and sweet 
potato root [59]. Furthermore, HMGRl expression was 
induced by treatment with MeJA in potato, whereas 
HMGR2 expression was reduced [56]. In response to cat- 
erpillar herbivory, transcripts of HMGRl were reduced in 
alfalfa [60]. Our results show that in tomato stem tricho- 
mes HMGRl and HMGR3 were induced by JA treatment 
approximately 2-fold, whereas expression of HMGR2 
remained unaltered (Table 4). None of the prenyl diphos- 
phate synthases were induced in tomato trichomes by 
JA treatment, whereas two seemed to be downregulated 
(FPS, SGN-U578686; and GGPS, SGN-U573348; Table 4). 
We did not find any transcripts for GGPSl (SGN-U574849) 
in our stem trichome database, although it has been shown 
to be induced in tomato leaves by JA-treatment [61]. Finally, 
from the very recently identified cw-prenyltransferases 
only CPTS, that produces medium-length chain polyiso- 
prenoids [62], was upregulated by JA, 1.7-fold (Table 4). 

Identification of transcription factors involved in 
regulating terpene synthases in tomato trichomes 

Our primary aim was to identify transcription factor(s) 
that regulate terpene biosynthesis. Based on the anno- 
tated contigs, 2.7% of the transcripts in the tomato stem 
trichomes encode transcription factors. For comparison, 
in Arabidopsis thaliana ~6% of the genes in all tissues 
encode TFs (TAIRIO genome release, [63]). Since JA is 
essential for establishing indirect defense responses in 
tomato [34,35] and the induction of terpene synthases in 
trichomes [16,21], we hypothesized that TFs involved in 
the regulation of terpene biosynthesis would also be JA- 
inducible genes. Most of the transcription factors known 
to be involved in regulation of terpenoid pathways are 
jasmonate-inducible and of the APETALA2, WRKY or 
MYC class [25-30,33]. However, in Arabidopsis it was 
recently shown that two MYC transcription factors 
(AtMYC3 and AtMYC4), which act additively with 
AtMYC2 in the activation of JA responses, are, in contrast 
to AtMYC2, only marginally induced by JA treatment 
[64]. Based on all the above, the initial selection of tran- 
scription factors to be analyzed from our quantitative stem 
trichome database was limited to TFs of the AP2, WRKY 
and MYC class that showed a 2-fold or higher induction 



by JA treatment (2.2-fold was the induction rate of control 
gene SlMTSl; [16], renamed SITPS5 [21]; Table 5). None 
of the MYC transcription factors of our database showed 
induction higher than 2, so for further analysis the closest 
homolog of AtMYC2 [32] was selected, as it has been 
shown to activate the AtTPSll and AtTPS21 promoters 
[33]. After discarding TFs that were not trichome-specific 
or did not show highest expression in trichomes, the list 
was narrowed down to four candidate transcription fac- 
tors. According to the Q-RT-PCR data however, none 
of these TFs was significantly induced by JA treatment 
(Figure 2). Since the numbers of sequence reads of these 
genes is very low both in the Control and JA samples 
(Table 6), the fold-induction in the lUumina experiments 
must have been overestimated. 

SIMYC1 and SIWRKY73 transactivate terpene synthase 
promoters in planta 

A specific indication of whether any of these TFs are 
involved in regulating terpene biosynthesis would be the 
activation of terpene synthase promoters by the transcrip- 
tion factor. In transient activation assays in N. benthami- 
ana leaves two of the four selected transcription factors 
were able to transactivate at least one terpene synthase 
promoter. S1WRKY73 showed strongest transactivation of 
the SITPSS promoter and in lesser extent of the SITPS3 
and SITPS7 promoters (Figure 3a). Although SIWRKY73 
is expressed highly in roots (Figure 2), S1WRKY73 could 
not transactivate the promoter of SITPSS that is mainly 
expressed in roots. It could also not transactivate the 
trichome-specific sesquiterpene synthase SITPS9 pro- 
moter so it is possible that S1WRKY73 can transactivate 
only monoterpene synthases or at least not the sesqui- 
terpene synthase tested here (Figure 3a). As shown in 
Figure 5 SIWRKY73 and the respective TPSs that it can 
transactivate are co-expressed in various tissues where the 
regulation could take place in the plant. 

SlMYCl showed strongest transactivation of SITPSS 
and SITPS3 and to a lesser extent of SITPS7 and SITPS9 
but no transactivation of SITPSS promoter (Figure 3b), 
although SlMYCl is also expressed in the root, albeit 
not strongly (Figure 2). As shown in Figure 5 SlMYCl is 
expressed (at different levels) in every plant tissue and 
SlMYCl is able to activate all the terpene synthase pro- 
moters tested except one, so it seems to be a regulator 
of multiple TPSs, in contrast to SlEOTl that is only 
expressed in the glandular trichomes and can specifically 
transactivate the SITPSS promoter and none of the other 
TPS promoters tested (Figure 5, [47]). The other two 
selected TFs (S1WRKY78 and S1WRKY28; Additional 
file 1: Figure S3) were not able to significantly transacti- 
vate any of the tested terpene synthase promoters. However 
it cannot be excluded that these TFs were not expressed in 
the transient assay. 
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Figure 5 Expression patterns and activation overview of TFs and TPSs. Putative positive interaction of tine respective SITPS promoters by 
transcription factors (a) SIMYCl, (b) SIWRKY73 and (c) SIEOTl is indicated by a colored box that represents the tissue in which they are co-expressed. 
Expression in the various tissues is indicated by +++, ++, +, +/- and - according to Q-RT-PCR values. Darker shaded boxes indicate a stronger transient 
activation of the TPS promoter by the TF in N. benthamiana leaves. EOTl; Expression of Terpenoids 1 [47]. 



One question that arises is, of course, where S1WRKY73 
and SIMYCl bind on these terpene synthase promoters. In 
the promoter sequence of SITPSS, SITPSS and SITPS7 [47] 
there are five, four and one W-boxes (TGAC(C/T)) re- 
spectively (PLACE; [65], Additional file 1: Table S2), which 
could serve as potential binding site(s) for S1WRKY73. 
Furthermore, SITPSS promoter contains two G-box-like 
elements (CACATG instead of the canonical CACGTG), 
one T/G-box element (AACGTG) and one T/G-box-like 
element (TACGTG) (Additional file 1: Table S2), which 
could potentially be the binding site(s) of SIMYCl. The 
promoter of SITPSS, with which SIMYCl interacts less 
strongly, contains one G-box-like element and one T/G- 
box element (Additional file 1: Table S2). The SITPS7 
promoter, which SIMYC can also activate, contains one 
T/G-box (Additional fUe 1: Table S2). The SITPS9 pro- 
moter [47] however, does not contain any of these 
elements, which could indicate the existence of an unchar- 
acterized motif to which SIMYCl binds. When using the 
motif search program MEME [66] with all four promoters 
that SIMYCl can activate, one 8 bp motif was identified in 



the plus or minus (for SITPS9) orientation: CTAGG(T/A) 
(A/G)G. The validation of a (putative) regulatory element 
as the binding site for these TFs would require extensive 
further experimentation. However, since our transactiva- 
tion assays do not indicate direct binding, the TF-TPS 
promoter interactions observed in the ATTAs, could take 
place through an additional protein. To address the issue 
of which terpene synthases (and possibly other genes as 
well) these TFs regulate, we are currently starting the 
more laborious but more conclusive approach of creating 
stably transformed silenced and overexpressing plants. 

SIMYCl acts synergistically with SIEOTl In the 
transactivatlon of the SITPSS promoter 

Interestingly, SIEOTl and SIMYCl acted synergistically 
in the transactivatlon of the SITPSS promoter (Figure 4). 
Combinatorial control of transcriptional regulation is 
commonly found in plants and other eukaryotes [67]. 
For example, in abscisic acid (ABA) signaling, the 67 bp 
promoter region of the dehydration-responsive gene 
rd22 contains a MYC and a MYB recognition site, where 
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AtMYC2 and AtMYB2 can bind, respectively. In Arabi- 
dopsis leaf protoplasts it was shown that these TFs could 
individually activate transcription of ^-glucuronidase driven 
by this 67 bp promoter region of rd22 and that the transi- 
ent activation was stronger when AtMYC2 and AtMYB2 
were combined [68]. Transgenic plants overexpressing 
these TFs each showed ABA hypersensitivity but the effect 
was more profound in plants overexpressing both TFs [69]. 

Given the fact that SlMYCl and SlEOTl are not in- 
duced by JA (Figure 2, [47]) and yet the proteins can 
transactivate the JA-inducible SITPSS promoter indicates 
that they could be regulating the steady-state transcription 
of SITPSS. These TFs might however also be involved in 
the enhanced SITPSS expression by interacting with other, 
inducible TF{s). From the well-studied cases of trans- 
criptional regulation in Catharanthus roseus [25-27] and 
Arabidopsis [32,33,64] it has becomes clear that it usually 
involves a network of TFs. In Solatium lycopersicum we 
are only just starting to unravel the complexity of tran- 
scriptional regulation of terpene biosynthesis. 

Methods 

Hormone treatment and RNA isolation 

Tomato plants {Solarium lycopersicum cultivar Money- 
maker) were grown in soil in a greenhouse with day/night 
temperatures of 23°C/18°C and a 16/8 h light/dark regime 
for four weeks. They were then sprayed either with JA 
solution (1 mM JA; Duchefa, NL, in tap water + 0,05% 
SilwetL-77; GE Silicones, VA, USA) or with control solu- 
tion (0,05% SilwetL-77 in tap water). Stem pieces were 
collected 30 min, 2 h, 8 h and 24 h later for pyrosequenc- 
ing or 24 h later for expression analyses and trichomes 
were isolated by shaking the stems in liquid nitrogen. 
Total RNA was isolated using TRIzol (Invitrogen, Paisley, 
UK) according to the manufacturer's instructions. Equal 
amount of trichome RNA from the different time points 
was pooled creating the control (C) and JA samples. RNA 
used for pyrosequencing was then purified on a RNeasy 
Plant column (Qiagen, Valencia, CA, USA). 

Transcriptome database construction 

RNA quality was determined with the Agilent RNA pico 
chip (Agilent Technologies, Waldbronn, Germany). Syn- 
thesis and amplification of cDNA was performed using 
the SMART PGR cDNA Synthesis and Advantage 2 PGR 
kits (Glontech Inc., GA, USA) according to the manufac- 
turer's instructions with some modifications of adapters 
to eliminate 3' poly( A) -stretches prior to sequencing. 
cDNA quality was determined with the Agilent DNA 
7500 chip (Agilent Technologies, Waldbronn, Germany) 
or on an 1% agarose/EtBr gel. Normalization of the cDNA 
was carried out using the Evrogen TRIMMER kit (Evrogen, 
Moscow, Russia) according to the manufacturer's protocol. 
The normalization efficiency was determined both on an 



agarose/EtBr gel (1%) and with an Agilent DNA 7500 chip. 
The cDNA was purified and concentrated using the Qia- 
quick PGR purification kit (Qiagen, Valencia, GA, USA). 
cDNA shearing and FLX Titanium library preparation 
was carried out using the Roche GS FLX Titanium Gen- 
eral Library Preparation Method kit (Roche Diagnostics, 
Mannheim, Germany) according to the manufacturer's 
protocol. The size range of the fragments was determined 
with an Agilent DNA 1000 chip (Agilent Technologies, 
Waldbronn, Germany). Exclusion of smaller-sized frag- 
ments was performed using the double SPRI method as 
described in the Roche GS FLX Titanium General Library 
Preparation protocol (Roche Diagnostics, Mannheim, 
Germany). End-polishing, small fragment removal, library 
immobilization, fill-in reaction and single-stranded library 
isolation was performed using the GS FLX Titanium Gen- 
eral Library Preparation Method kit (454 Life Sciences, 
Roche Diagnostics, Mannheim, Germany) according to 
manufacturer's instructions. 

Expression profiling database construction 

Starting from the same total RNA samples (G and JA, see 
above), mRNA was amplified and purified using the Mes- 
sageAmp II aRNA Amplification kit (Applied Biosystems/ 
Ambion, CA, USA) according to manufacturer's instruc- 
tions. RNA quality was determined with the Agilent RNA 
pico chip (Agilent Technologies, Waldbronn, Germany). 
Synthesis of cDNA was performed using the MessageAmp 
II aRNA Amplification kit (Applied Biosystems/ Ambion, 
GA, USA) according to manufacturer's instructions with 
modifications of the adapters to enable sequencing of 3' 
cDNA ends. cDNA was purified with the Qiaquick 
PGR purification kit (Qiagen, Valencia, CA, USA). cDNA 
quality was determined with the Agilent DNA 7500 chip 
(Agilent Technologies, Waldbronn, Germany) or on an 
1% agarose/EtBr gel. Shearing and ligation was carried out 
using standard lUumina PE adapters containing a specific 
sample ID tag. Adapter-ligated cDNA fragments were 
column purified with the Qiaquick PGR purification kit 
(Qiagen, Valencia, CA, USA). The size range of the frag- 
ments was determined with an Agilent DNA 1000 chip 
(Agilent Technologies, Waldbronn, Germany). Exclusion 
of smaller-sized fragments was performed using a single 
SPRI procedure as described in the Agencourt Ampure 
PGR Purification protocol (Agencourt Bioscience Corpor- 
ation, MA, USA). The size range of single-stranded frag- 
ments was determined with an Agilent RNA pico 6000 
chip (Agilent Technologies, Waldbronn, Germany). Ex- 
pression profiling was performed using the lUumina Gen- 
ome Analyzer II System (lUumina, USA). 

Databases assembly, EST annotation and homology searches 

The 454 sequencing reads (Control and JA combined) were 
assembled into contigs de novo by Vertis Biotechnologie 
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AG, Germany using the CLCbio software [70]. Nucleotide 
sequences of the contigs were then blasted against the SGN 
unigenes v2 tomato database (ftp.solgenomics.net/unigene_ 
builds/combined_species_assemblies/tomato_species) for 
annotation, using a local Eblast tool {E value le-9). The 
GA II reads (Control and JA separately) were mapped to 
the annotated contigs of the 454 sequencing trichome 
database by Vertis Biotechnologie AG, Germany. 

The resulting contigs were also imported in the bioinfor- 
matics tool Blast2GO v.2.5.0 [37] and were blasted against 
the National Center for Biotechnology Information (NCBI) 
non-redundant protein database BLASTX (£ value le-3). 
Further analyses with this tool included functional annota- 
tion by Gene Ontology (GO) terms and Enzyme Commis- 
sion numbers (EC code), InterPro terms (InterProScan; 
[71]) and metabolic pathways (Kyoto Encyclopedia of 
Genes and Genomes, KEGG; [72]). 

cDNA synthesis and quantitative-real time-PCR 

DNA was removed from RNA with DNAse (Ambion, 
Huntingdon, UK) according to the manufacturer's ins- 
tuctions and cDNA was synthesized from 1.5 |ig RNA 
using M-MuLV Reverse Transcriptase (Fermentas, 
St. Leon-Rot, Germany). For Q-RT-PCR, cDNA equiva- 
lent to 100 ng total RNA was used as template in 20 \i\ 
volume and reactions were performed in the ABI 7500 
Real-Time PCR System (Applied Biosystems) using the 
Platinum SYBR Green qPCR SuperMix-UDG kit (Invi- 
trogen. Paisley, UK) with the following cycling program: 
2 min 50°C, 7 min 95°C, 45 cycles of 15 sec at 95°C and 
1 min at 60°C, followed by a melting curve analysis. 
Primer pairs were tested for amplification kinetics and 
linearity with a standard cDNA dilution curve and new 
primers were designed if necessary. Expression levels 
were normalized using ACTIN (SGN-U579547) mRNA 
levels. Effects of JA on gene expression were analyzed in 
three biological replicates by T-test using PASW Statistics 
17.0 [73]. The homogeneity of variance was tested by 
Levene's test. 

Cloning, construct design and ATTAs 

TFs SlMYCl (KF430611; sequence of the full-length 
ORE obtained from the 454 trichome database), SIWRKY28 
and SIWRKY73 ([74]; Additional file 1: Figure S4) were 
cloned between restriction sites Ncol (at the ATG) and Sad 
(at the 3 ' end of the sequence) in front of the Nos termin- 
ator in vector pKG1662 (KeyGene, Wageningen, NL; for a 
map of the vector see patent nr US2011/0113512A1) 
driven by the CaMV 35S promoter. TF SIWRKY78 ([74]; 
Additional file 1: Figure S4) was cloned downstream of 
the CaMV 35S promoter in vector pJVII, a pMON999- 
based vector (Monsanto, St. Louis, MO) with a modified 
multiple cloning site (MCS), between restriction sites Xbal 
(at the ATG) and BsrGl (at the 3' end of the sequence). 



All constructs were verified by sequencing and then the 
expression cassettes containing 35S promoter, cDNA of 
interest and nos terminator were transferred to the MCS 
of the binary vector pBINplus [75] between Hindlll and 
Smal restriction sites. The final constructs were trans- 
formed to Agrobacterium tumefaciens GV3101 (pMP90). 
The promoter:GUS constructs used in the transient trans- 
activation assay have been described elsewhere [47]. The 
A. tumefaciens transient transactivation assay (ATTA) was 
performed as described in Spyropoulou et al., [47]. 

Additional file 



Additional file 1: Figure SI. Gene ontology (GO) and enzyme 
classifications (EC) for S. lycopersicum stem trichome transcriptome at level 2. 
(a) Cellular component GO terms, (b) biological process GO terms, 
(c) molecular function GO terms and (d) general EC terms. Figure S2. 
Transactivation ofterpene synthase promoters by 35S:RFP in N. benthamiana 
leaves. Letters indicate significant differences (n = 4, ANOVA, P < 0.05 
according toTuckey's B posthoc test). The normalized GUS activity of the 
SITPS3, 7, 8 and 9 reporter construas with the RFP effector construa is not 
significantly higher from the SITPS5 reporter construct with the RFP effector 
construct, indicating that any relevant activation of an effector construct 
(in Figures 3, 4, and Additional file 1 : Figure S3) must be significantly higher 
than that of the SITPS5p:GUS reporter- 35S:RFP effector combination. 
Figure S3. Transactivation of terpene synthase promoters by SIWRKY78 and 
SIWRKy28 in N. benthamiana leaves. Letters indicate significant differences 
(n = 3, ANOVA, P < 0.05 according to Tuckey's B posthoc test). Representative 
results from two experiments are shown. The normalized GUS activity of 
the 35S:WRKY28 effector- SITPS5p:GUS reporter construct combination 
was only marginally higher than that of the negative control (35S:RFP 
effector- SITPS5p:GUS reporter constructs) and was not further investi- 
gated. Figure S4. Nucleotide sequence of transcription factors SIWRKY78 
(Solyc07g055280.2.l), SIWRKY28 (Solyc 1 2g0 1 1200.1.1), SIWRKY73 
(SolycOSgl 13120.21) and SlMYCl (KF43061I). The predicted coding 
sequences are in capital letters, 5' and 3' UTRs are in small letter type. 
Start and stop codons are in bold. Table SI. KEGG pathways found in 
the S. lycopersicum stem trichome transcriptome. Table S2. Selected 
regulatory motifs in the sequence of SITPS5, 3 and 7 promoters analyzed 
by PLACE [65]. Table S3. List of primers used. 
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